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ABSTRACT 

Syncytin-1 and -2, human fusogenic glycoproteins 
encoded by the env genes of the endogenous retro- 
viral loci ERVWE1 and ERVFRDE1, respectively, 
contribute to the differentiation of multinucle- 
ated syncytiotrophoblast in chorionic villi. In non- 
trophoblastic cells, however, the expression of 
syncytins has to be suppressed to avoid potential 
pathogenic effects. We studied the epigenetic sup- 
pression of ERVWE1 and ERVFRDE1 5-long 
terminal repeats by DNA methylation and chromatin 
modifications. Immunoprecipitation of the provirus- 
associated chromatin revealed the H3K9 trimethyl- 
ation at transcriptionally inactivated syncytins in HeLa 
cells. qRT-PCR analysis of non-spliced ERVWE1 
and ERVFRDE1 mRNAs and respective env mRNAs 
detected efficient splicing of endogenously ex- 
pressed RNAs in trophoblastic but not in non- 
placental cells. Pointing to the pathogenic potential 
of aberrantly expressed syncytin-1, we have found 
deregulation of transcription and splicing of the 
ERVWE1 in biopsies of testicular seminomas. 
Finally, ectopic expression experiments suggest the 
importance of proper chromatin context for the 
ERVWE1 splicing. Our results thus demonstrate 
that cell-specific retroviral splicing represents an 



additional epigenetic level controling the expression 
of endogenous retroviruses. 

INTRODUCTION 

Human endogenous retroviruses (HERVs) form a sub- 
stantial part of the human genome. Despite the huge 
number of HERV copies, only a few intact open reading 
frames (ORFs) survived, avoiding the gradual accumula- 
tion of compromising mutations and/or deletions. Out of 
18 intact env genes identified among various HERV 
families (1,2), four were shown to encode bona fide 
fusogenic retroviral envelope glycoproteins (1). The 
ERVWE1 provirus encodes a full-length envelope 
product dubbed syncytin-1, whose transmembrane (TM) 
subunit induces cell-to-cell fusion during differentiation of 
placental syncytiotrophoblast (3,4). Binding of the surface 
(SU) syncytin-1 subunit to the human sodium-dependent 
neutral amino acid transporters hASCTl and hASCT2 is 
the first step of a complex cell fusion process (5). 
Syncytin-2 encoded by the provirus ERVFRDE1 is spe- 
cifically expressed in the placenta and also displays 
fusogenic activity (6-8) in dependence on the carbohy- 
drate transducer MFSD2 (9). Abnormalities of placental 
development and eclamptic disorders in pregnancy are 
often accompanied by altered syncytin-1 expression 
(10,11), whereas syncytin-2 acts rather as an 
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immunosuppressor (12) and has been associated with 
abnormal trophoblast differentiation in placenta affected 
by trisomy 21 (13). 

A high transcriptional activity of HERV elements has 
been detected in various healthy human tissues analyzed 
by a retro virus-specific microarray (14) and quantitative 
RT-PCR (1). Expression of syncytins, however, is re- 
stricted to the placental trophoblast and only a weak ex- 
pression was observed in the testes (3,4,6,15). Aberrant 
expression of syncytins outside of the placenta could 
trigger pathogenic changes, particularly chronic inflam- 
mation and inadvertent cell-to-cell fusions. Indeed, 
syncytin-1 has been detected in astrocytes and activated 
glial cells in the brain of multiple sclerosis patients (16,17). 
Syncytin-1 immuno reactivity has also been found in breast 
carcinomas and in higher-grade colorectal carcinomas 
(18,19). Cell-to-cell fusion in tumors may contribute to 
aneuploidy and promote gradual development of malign- 
ancies (20). Accordingly, syncytin-1 was shown to be 
involved in the fusion of breast cancer and endothelial 
cells (18). The hASCTl and two receptors are widely ex- 
pressed in many tissues, particularly in the liver and 
brain (21), and do not limit the fusogenic effect of 
syncytin-1. The MFSD2 receptor, in contrast, is expressed 
in the placenta and to a lesser extent in the testis and 
intestine (9). 

Expression of syncytins must, therefore, be tightly 
controlled in order to avoid pathogenic fusions in 
non-placental tissues. The trophoblast-specific expression 
of syncytin-1 is controlled in part by the cyclic 
AMP-responsive ERVWE1 5'-LTR and in part by an 
upstream regulatory element (URE) of ~400bp contain- 
ing the binding sites for AP2, Spl and Glial Cell Missing a 
(GCMa), the principal transcriptional factor of syncytin-1 
and inductor of syncytiotrophoblast differentiation 
(22,23). Despite the tissue specificity of the URE, weak 
basal expression from ERVWE1 LTR can be observed 
after transient transfection into non-placental cells (23) 
and GCMa is not essential for this promoter activity 
(22). Therefore, the stringent suppression of syncytin-1 
expression requires involvement of other mechanisms. 
We have shown that CpG methylation of ERVWE1 
5'-LTR occurs in non-placental tissues and cell lines but 
not in trophoblastic cells from the placenta or choriocar- 
cinoma Be Wo cell line (24). These results were recently 
confirmed and also extended to ERVFRDE1 (25). 
Furthermore, CpG methylation of the ERVWE1 5'-LTR 
in non-placental cells is strongly resistant to demethyl- 
ation by a DNA methyltransferase inhibitor and the 
reporter gene expression driven by ERVWE1 5'-LTR is 
efficiently suppressed by CpG methylation in transient ex- 
pression assay (24). The whole picture of ERVWE1 sup- 
pression is, however, more complicated. At least in the 
testes, ERVWE1 transcripts have been detected (4) 
without any pathology. Because HERVs are known to 
keep the retroviral type of RNA splicing where only the 
spliced env mRNA serves for translation of the enve- 
lope glycoprotein, it is possible that ERVWE1 and 
ERVFRDE1 splicing adds another regulatory level to 
syncytins expression. The presence of two HERV-W- 
specific mRNA forms was previously detected in human 



placenta by northern blotting (4). Further analysis 
demonstrated the usage of two 5'-splice signals and three 
3'-splice signals in the provirus ERVWE1 and splicing-out 
the 2.1 kb intron like insert (26). This splicing strategy of 
ERVWE1 has been conserved among all apes (27). The 
splicing of ERVFRDE1 utilizes the predicted splice site 
signals in the 5'-LTR and upstream of the env gene as 
described by sequencing of the 5'-RACE amplicon (8). 

In the present study, we assessed the levels of genomic 
non-spliced and subgenomic spliced env mRNAs of both 
ERVWE1 and ERVFRDE1 in placental and non- 
placental cells by qRT-PCR. We demonstrate here that 
inefficient splicing together with epigenetic silencing by 
CpG methylation and inhibitory histone modifications 
suppress the expression of syncytins in non-placental 
cells. Based on these results, we suggest that the expression 
of endogenous retroviral glycoproteins in the mammalian 
genome can be efficiently controlled by tissue-specific 
pre-mRNA splicing. 

MATERIALS AND METHODS 

Tissue samples 

All samples of healthy testicular tissues and testicular 
tumors were obtained from the Department of Urology 
of the Third Faculty of Medicine, Charles University in 
Prague. Biopsies taken from patients with hormone- 
sensitive prostate cancer treated by bilateral orchiectomy 
represented healthy testicular tissue (Tl-h, T2-h) with 
prevalence of spermiogenous epithelium from which the 
tunica albuginea was discarded. Histological inspection 
showed normal testicular tissue with decreased spermio- 
genesis because of the advanced age of these patients. Six 
testicular tumors were included to this study; histologi- 
cally they are seminomas (T3-s, T4-s, T5-s and T7s), one 
of them with a big component of granulomatous 
multinuclear cells (T3-s), mixed germinal tumor with em- 
bryonal carcinoma and teratoma components (T-6t) and 
testicular lymphoma (T8-t). Histologically normal testicu- 
lar tissues surrounding the tumors were obtained by 
macrodissection (T5-h, T6-h, T7-h and T8-h). Human 
ethics approval was granted by the Committee for 
Ethics, Handling Recombinant DNA and Clinical 
Research at the Institute of Molecular Genetics, and 
informed consents were obtained from the patients. 

Cell culture procedures 

Choriocarcinoma BeWo and JEG-3 cells were maintained 
in F-12 and MEM-D media mixed 1:1 (Sigma) supple- 
mented with 1% NaHC0 3 , 10% FBS, 300(ig/ml glutam- 
ine, a mix of penicillin and streptomycin (100 ug/ml each), 
and 2.5mg/ml amphotericin B at 37°C in humified atmos- 
phere of 5% C0 2 . HeLa cells were grown under the same 
conditions with the FBS supplement of 5%. Activation of 
Be Wo and JEG-3 cells was done with 40 uM forskolin 
(Sigma) for 48 h. Transfections of 2x10 JEG-3 and 
HeLa cells seeded on the six-well plates were done with 
FuGENE HD lipofection reagent (Roche) according to 
the manufacturer's recommendation. Stable transfection 
of JEG-3 and HeLa cells was done with 0.2 ug of 
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pcDNA3 plasmid DNA mixed with a 10-fold excess of 
cloned ERVWE1 DNA (see below). The pcDNA3 
plasmid DNA alone was used for mock-transfection. 
Two days after transfection, the cells were selected for 
the resistance to G418 and expanded single cell clones 
were assayed in splice-specific RT-PCR after 3 weeks. 

Cloning of the full-length ERVWE1 provirus 

The 10 900 bp fragment of genomic placental DNA con- 
taining the ERVWE1 provirus together with the URE at 
the 5'-end was amplified using the Phusion High-Fidelity 
DNA Polymerase (Finnzymes) according to the manufac- 
turer's instructions. Primers and PCR conditions are given 
in Supplementary Data. The PCR fragment was ligated 
into the pGEM T-easy vector (Promega), verified by re- 
striction digestion and partial sequencing. 

Bisulfite analysis of cytosine methylation 

Isolation of DNA samples, sodium bisulfite treatment, 
PCR and sequencing of PCR clones were performed es- 
sentially as described previously (24). See Supplementary 
Data for the primers for semi-nested PCR of ERVFRDE1 
from bisulfite-treated DNAs. We analyzed 6-13 independ- 
ent PCR clones with at least 95% conversion of cytosine 
outside of CpG dinucleotides. 

Chromatin immunoprecipitation assays 

Chromatin immunoprecipitation (ChIP) assays were per- 
formed using the ChIP assay kit (Upstate Biochemicals, 
Temecula, CA). BeWo and HeLa cell lines, either non- 
transfected or stably transfected with cloned ERVWE1 
proviral DNA, were cross-linked in the presence of 1% 
formaldehyde for lOmin at room temperature. Chromatin 
was sheared by pulse sonication at 40% amplitude for 
6min to obtain DNA fragments of ~200-500bp in a 
Digital Sonifier S-450D (Branson, Danbury, CT). The 
antibodies used for ChIP were as follows: anti-acetyl- 
histone H3 (Lys 9), anti-trimethyl-histone H3 (Lys 9), 
anti-trimethyl-histone H3 (Lys 36) (ab4441, ab8898 and 
ab9050, respectively, all from Abeam, Cambridge, UK) 
and anti-histone H3 (07-690 Millipore). For the control, 
we immunoprecipitated the cell extracts using normal 
rabbit IgG (Millipore-Upstate) instead of hyperimmune 
antibody. Immunoprecipitated chromatin was subjected 
to RT-PCR quantification using primers specific for the 
transcription start site in the 5'-LTR of the proviruses 
ERVWE1 and ERVFRDE1, for the central part of the 
ERVWE1 intron, for the 5'-end of the ERVWE1 exon, 
and for the GAPDH gene (see Supplementary Data for 
the sequence of primers). The amount of immunopre- 
cipitated material was normalized to the input DNA. 
The enrichment values relative to the input were low but 
in accordance with values shown for the respective 
antibodies by the manufacturer. 

Splice-specific quantitative RT-PCR 

Total RNA was isolated by using the Tri Reagent 
(Sigma) according to manufacturer's instructions. One ug 
of total RNA was DNase-treated for 1 5 min and reverse 



transcribed in a total volume of 50 ul, with M-MLV 
Reverse Transcriptase (Gibco) and random hexamers for 
priming. One ul of the resulting cDNA was then used for 
the quantitative PCR based on the MESA GREEN qPCR 
MasterMix Plus for SYBR Assay Kit (Eurogentec) and a 
Chromo4 system for RT-PCR detection (Bio-Rad). 
Quantifications were performed for the ERVWE1 non- 
spliced and spliced transcripts, the ERVFRDE1 non- 
spliced and spliced transcripts, and for the RNA 
polymerase Ha (POLR2A) as a housekeeping reference 
gene. The volume of the reaction mixture was 20 ul with 
400 nM final concentration of each primer (see the 
Supplementary Data for the sequence of primers). 
External standards were constructed by PCR using 
BeWo cDNA and transcript-specific primer sets. 
Resulting PCR fragments of ERVWE1 and ERVFRDE1 
were cloned into the pGEM-TEasy (Promega) and verified 
by sequencing. Ten-fold serial dilutions of external 
standard plasmids containing the genomic or spliced 
cDNA of either ERVWE1 or ERVFRDE1, ranging from 
10 to 10 6 copies per reaction, were used for construction of 
the calibration curves. Ten-fold serial dilutions of plasmid 
containing the fragment of the human POLR2A gene, 
ranging from 10 to 10 6 copies per reaction, were used for 
the construction of POLR2A calibration curve. We 
normalized the numbers of RNA copies of both proviruses 
placed in the reaction according to the estimated number of 
POLR2A transcripts. Cycling conditions for ERVWE1 
and POLR2A were 5 min at 95° C, 45 cycles of 15 s at 
95°C, 20 s at 55°C, 30 s at 60°C. Cycling conditions for 
ERVFRDE1 and POLR2A were 5 min at 95° C, 45 cycles 
of 15 s at 95°C, 60 s at 60°C. The negative controls included 
water as a template. All quantitative RT-PCRs were per- 
formed in triplicate. The specificity of the PCR products 
amplified was confirmed by melting curve analysis and by 
sequencing the PCR products. To assess the amount of 
contaminating exogenous DNA, either genomic or 
plasmid, we included reactions run in the absence of the 
reverse transcriptase as negative controls. The background 
values of these negative controls were subtracted from the 
results of repective reactions with reverse transcriptase. 



RESULTS 

The histone modifications of active and silenced ERVWE1 
and ERVFRDE1 loci 

Post-translational histone modifications, particularly 
acetylation or trimethylation of histone H3 lysine 9 
(H3K9), are signatures of active transcription or gene re- 
pression (28). To investigate this epigenetic level of tran- 
scriptional regulation, we compared H3K9 modifications 
associated with ERVWE1 and ERVFRDE1 5'-LTRs by 
ChIP in choriocarcinoma BeWo cells and cervical carcin- 
oma HeLa cells, which show transcriptional activity or 
gene repression of both promoters, respectively. H3 
acetylation was highly enriched in the 5'-LTR region of 
ERVWE1 in BeWo cells, but was negligible in HeLa cells. 
In contrast, trimethylation of H3K9 was extremely low in 
BeWo cells but highly enriched in HeLa cells (Figure 1A). 
The levels of H3 acetylation surrounding the 5'-LTR of 
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ERVFRDE1 were low in both HeLa and BeWo cells. 
Trimethylation of H3K9 was high in HeLa cells and low 
in BeWo cells (Figure IB). These results suggest that in 
BeWo cells, unlike ERVFRDE1, ERVWE1 is highly ex- 
pressed, whereas both proviruses are efficiently repressed 
in HeLa cells. The level of H3K9 trimethylation correlates 
perfectly with the CpG methylation in the 5'-LTR region 
of both proviruses (24,25). The constitutively expressed 
GAPDH gene did not display any significant differences 
in H3K9 modifications between BeWo and HeLa cells 
(Figure 1C). 

Quantitative analysis of ERVWE1 and ERVFRDE1 
proviral expression and splicing 

Next, we examined whether the expression levels of both 
syncytin loci correlate with the epigenetic status of their 
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Figure. 1. Acetylation and trimethylation of H3K9 in LTRs of 
syncytin-1 and -2 in BeWo and HeLa cells. ChIP assays were performed 
with antibodies against indicated histone modifications and the 
precipitated chromatin was subjected to qPCR with primers specific 
for (A) 5'-LTR of ERVWE1, (B) 5'-LTR of ERVFRDE1 and 
(C) coding region of human GAPDH, a constitutively expressed 
control gene. The results were normalized to IgG non-specific 
controls and shown as an average percentage of the input DNA. 
Data are presented as a mean ± SD from three triplicates. 



promoters. Splicing of the full-length genomic transcript 
into subgenomic env mRNA is necessary for the transla- 
tion into a cleavable envelope glycoprotein. We, therefore, 
used the splice- specific qRT-PCR to assess the efficiency 
of splicing in BeWo cells, the best model of trophoblast 
differentiation, and in HeLa cells. The localization of 
splice signals in ERVWE1, the position of primers and 
the size of amplified products are shown in Figure 2A. 
In BeWo cells, we detected the presence of the full-length 
non-spliced transcript and retention of the 2.1 kb itron- 
like insert. The ratio of spliced and non-spliced transcripts 
was 6:1, and the expression of the spliced syncytin-1 tran- 
script reached at least 130% of that of the housekeeping 
standard POLR2A. HeLa cells produced a very low level 
of the non-spliced ERVWE1 transcript and spliced tran- 
scripts did not raise above the background level (Figure 
2B). This corresponds to the lack of syncytin-1 expression 
as previously detected by northern blotting and 
non-quantitative PCR (22,24). Minor DNA contamin- 
ation was tightly controlled by including the 
RT-parallels (Figure 2C). We also detected the presence 
of pseudo-spliced mRNA without the 2. 1 kb intron-like 
insertion in BeWo cells. 

The splicing strategy of the ERVFRDE1 provirus and 
positions of primers for splice-specific RT-PCR are shown 
in Figure 2D. BeWo cells expressed the spliced transcript 
of ERVFRDE1 at much lower levels than ERVWE1 and 
reached only 2% of POLR2A expression level, while the 
ratio of spliced over non-spliced transcript was similarly 
high at 7:1. Again, the transcription of ERVFRDE1 in 
HeLa cells was low: the non-spliced transcript expression 
reached 0.35% of POLR2A expression, which was com- 
parable with BeWo cells, and the spliced transcript was 
not detected (Figure 2E). In conclusion, the transcription 
of ERVWE1 and ERVFRDE1 reflects the epigenetic 
status of their LTRs, and the cell-specific RNA splicing 
acts as an additional regulatory mechanism in the control 
of syncytin-1 and -2 expression. 

The expression of syncytin-1 and cell-to-cell fusion effi- 
ciency were shown to be augmented by forskolin, adenyl- 
ate cyclase activator. In our splice-specific qRT-PCR 
assay, stimulation of BeWo cells with forskolin increased 
ERVWE1 transcription with the ratio of spliced over 
non-spliced transcripts 5:1. There was only weak, statis- 
tically insignificant forskolin stimulation in JEG-3 cells 
(Supplementary Figure SI A). Even more striking was 
the shift of ERVFRDE1 transcription and splicing in 
BeWo cells, which increased up to 25% of POLR2A, 
and the ratio of spliced over non-spliced transcript was 
40:1 (Supplementary Figure SIB). No effect of the 
forskolin treatment was observed in HeLa cells (data not 
shown). 

Transcription and splicing of ERVWE1 and ERVFRDE1 
in the testes and germ cell tumors 

A weak signal of syncytin-1 mRNA was previously de- 
tected in the testes and testicular cancer (4,29). 
Therefore, the testes might be an organ where the tran- 
scriptional suppression of syncytin loci by CpG methyla- 
tion and chromatin modifications is leaky. In this tissue, 
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Figure. 2. Expression and splicing of ERVWE1 and ERVFRDE1 proviruses in trophoblastic and non-placental cells. Splicing strategies of (A) 
ERVWE1 and (D) ERVFRDE1 are shown together with the positions of primers for splicing-specific PCR (arrows) and the sizes of diagnostic PCR 
products. The levels of non-spliced (open columns) and spliced (black columns) transcripts of (B) ERVWE1 and (E) ERVFRDE1 in BeWo and 
HeLa cells were estimated by qRT-PCR and are shown as the average percentage of the POLR2A expression ± SD from three triplicates. The low 
level of genomic DNA contaminating the RT-PCR product of non-spliced ERVWE1 mRNA in BeWo cells is shown (C). Lane 1, marker of 
molecular size; lane 2, non-spliced ERVWE1 mRNA RT + ; lane 3, non-spliced mRNA RT~; lane 4, spliced mRNA RT + ; lane 5, spliced mRNA RT~; 
lane 6, marker of molecular size. 



the additional expression control due to differential splic- 
ing, reported above, would be of biological importance. In 
order to explore this possibility, we performed splice- 
specific qRT-PCR of ERVWE1 and ERVFRDE1 se- 
quences in total RNA samples prepared from healthy 
testes. As shown in Figure 3A, the level of non-spliced 
ERVWE1 transcripts reached only between 1% and 6% 
of the POLR2A transcripts with a hardly detectable pres- 
ence of the spliced transcript. The transcription of 
ERVFRDE1 in the testes was even weaker and reached 
~1% of POLR2A transcripts with roughly equimolar 
amounts of non-spliced and spliced transcripts 
(Figure 3B). In addition to two biopsies of healthy testes, 
we analyzed the expression of ERVWE1 in several biopsies 
from six different testicular germ cell tumors (Figure 3A). 
In several samples of four seminomas, the expression 
levels of both non-spliced and spliced transcripts were 
highly increased in contrast to normal tissues and compar- 
able or even higher by several fold to POLR2A expression. 



The ERVWE1 expression was variable in biopsies taken 
from different parts of the same tumor. The splicing was 
efficient in all tumor samples with a ratio of spliced over 
non-spliced transcripts varying between 1.9 and 4.0. In 
two samples of non-seminoma testicular cancer, we did 
not observe any significant increase of either ERVWE1 
expression or splicing. 

Previous studies showed the correlation between tran- 
scriptional suppression and promoter hypermethylation of 
ERVWE1 in non-placental cells (24,25). We, therefore, 
assayed the methylation status of 5'-LTR regions of the 
ERVWE1 and ERVFRDE1 (Figure 3C and D) proviruses 
in healthy testes and testicular germ cell tumors using the 
sodium bisulfite technique. We designed the forward 
primers for semi-nested PCR complementary to the 
unique genomic sequences adjacent to the 5'-LTRs to 
ensure the selective amplification of ERVWE1 and 
ERVFRDE1 loci (Figure 3C and D). In two samples of 
healthy testicular tissues, the ERVWE1 5'-LTRs were 
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Figure. 3. Transcription, splicing and CpG methylation of ERVWE1 and ERVFRDE1 in the healthy testes and testicular germ cell tumors. 
The levels of non-spliced (open columns) and spliced (black columns) transcripts of (A) ERVWE1 and (B) ERVFRDE1 in two samples 
of healthy testes, T2-h and T9-h, in two samples of granulomatous seminoma 3, T3-sl and T3-s2, three samples of seminoma 4, T4-sl, T4-s2 
and T4-s3, in tumor samples and adjacent healthy tissues of seminoma 5 and 7, T5-s, T5-h, T7-s and T7-h, in tumor sample and adjacent healthy 
tissue of mixed germ cell tumor 6, T6-t and T6-h and in tumor sample and adjacent healthy tissue of testicular lymphoma 8, T8-t and T8-h 

(continued) 
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found to be heavily methylated (Figure 3C) in all analyzed 
clones. The overall percentage of methylated CpGs was 
>90%, even higher than that in other non-placental 
tissues or cell lines reported in our previous study (24). 
None out of 20 sequenced clones was hypomethylated. 
The 5'-LTRs of ERVFRDE1 provirus displayed a 
slightly lower level of CpG methylation reaching 87 or 
58% of methylated CpGs in two samples of healthy testes 
(Figure 3D). The observed increase of the ERVWE1 ex- 
pression in seminomas was accompanied with a slight 
hypomethylation of its 5'-LTR varying between 38% 
and 91% of methylated CpGs (Figure 3C). There is, 
however, no correlation between the level of CpG 
demethylation and the ERVWE1 expression and we did 
not find completely unmethylated 5'-LTR copies in all 
seminoma samples expressing ERVWE1. No hypomethyl- 
ation was observed in non-seminoma germ cell tumor 
samples, which do not express ERVWE1 (Figure 3C). 
We conclude that 5'-LTRs of ERVWE1 and 
ERVFRDE1 are methylated in the testes and that the 
weak expression of syncytin-1 in healthy testes can be 
observed despite this epigenetic status. The inefficient 
splicing of ERVWE1 transcripts prevents improper 
syncytin-1 expression in normal testes but not in semi- 
nomas, where both non-spliced and spliced forms of 
ERVWE1 mRNAs occur. 

Splicing of ectopically expressed ERVWE1 mRNA 

The inefficient splicing of ERVWE1 in the testes and 
HeLa cells suggests the regulatory potential of HERV 
RNA processing. In order to test it experimentally, we 
quantified the spliced and non-spliced transcripts over- 
expressed ectopically from a non-methylated full-length 
ERVWE1 proviral clone containing the URE at the 
5'-end. The whole ERVWE1 provirus, amplified by PCR 
and cloned into a plasmid, was stably transfected into 
JEG-3 and HeLa cells and the resulting transcripts 
were quantified by splice-specific qRT-PCR. As shown 
in Figure 4A, the ectopic transcription of the exogenously 
introduced ERVWE1 provirus in two stably transfected 
clones of JEG-3 cells was two to four times higher 
than the transcription of the endogenous ERVWE1 in 
mock-transfected JEG-3 cells. The splicing of endogenous 
transcripts was efficient, comparable with BeWo cells 
(Figure 2B), and the level of non-spliced transcript did 
not increase significantly in transfected cells, suggesting 
that the exogenous copies integrated in different 
genomic locations are prone to transcription and splicing 
in the cells of trophoblastic origin. In a parallel experi- 
ment, two independent HeLa cell clones, ERVWE1A 
and ERVWE1B, with stable expression of ERVWE1 
from ectopically integrated proviruses displayed less effi- 
cient splicing with only slight excess of spliced over 



non-spliced transcripts (Figure 4B). Although the stable 
ectopic expression of ERVWE1 in HeLa cells was much 
lower than the endogenous expression in JEG-3 cells and 
reached just 20 and 4% of the POLR2A, the efficiency of 
splicing from the exogenous copies was in striking contrast 
to the absence of splicing in mock-transfected HeLa cells. 
Altogether, these results indicate that the integration of 
the unmethylated ERVWE1 provirus into ectopic 
chromosomal positions and different chromatin context 
alleviates the blocks of transcription and splicing that 
are present at the endogenous ERVWE1 locus. Both tran- 
scription and splicing, however, lack the positive 
tissue-specific control and their efficiencies do not reach 
the values observed in trophoblastic JEG-3 cells. 

H3K36me3 at the ERVWE1 intron/exon boundary 

Among the histone post-translational modifications, the 
trimethylation of lysine 36 of H3 (H3K36me3) has been 
reported to be enriched on exons and to mark the ex- 
pressed exons (30-32). Differences in H3K36me3 levels 
were also shown at actively used alternative exons of the 
FGFR2 gene (33). We, therefore, compared the 
H3K36me3 at the central part of intron versus 5'-part of 
the ERVWE1 exon, i.e. the beginning of the syncytin-1 
coding sequence, in BeWo and HeLa cells. Because the 
differences in H3K36me3 described in ChIP analyses of 
single genes or in ChlP-on-chip experiments are low and 
reflect the nucleosome density (34), we compared the oc- 
cupancy of total H3 at the same sites and normalized the 
levels of H3K36me3. As shown in Figure 5 A, the H3 oc- 
cupancy of the ERVWE1 in BeWo cells was similarly low 
at both the intron and the exon. In HeLa cells, there was, 
however, significantly increased H3 occupancy at the in- 
tron and low at the 5'-end of the exon. The relative level of 
H3K36 in BeWo cells was similarly high along the intron/ 
exon boundary (Figure 5B). In contrast, the relative level 
of H3K36me3 in HeLa cells increased 7-fold along this 
boundary, reaching in the exon approximately the same 
level as observed in BeWo cells (Figure 5B). We further 
tested the H3 occupancy and H3K36me3 level in two in- 
dependent HeLa cell clones with stable insertions of 
ectopic ERVWE1. Here, we took into account the incre- 
ased copy numbers of ERVWE1, up to five proviruses 
introduced into unknown genomic positions. The value 
of H3 occupancy at the intron in the first clone, 
ERVWE1A, was comparable with intact HeLa cells and 
provided the increased proviral copy number, we can 
expect the presence of proviruses with low, BeWo-like 
H3 occupancy (Figure 5A). This corresponds with 
basal expression and splicing of ERVWE1 in this clone 
(Figure 4B). The second clone, ERVWE1B, displayed a 
very high value of H3 enrichment and provided the 
increased proviral copy number, we expected that in this 



Figure. 3. Continued 

were estimated by qRT-PCR and are shown as the average percentage of the POLR2A expression ± SD from three triplicates. The samples of 
healthy testes and non-seminoma tumors are shown in the inset with different scale of the j-axis. The CpG methylation status of the 5'-LTR U3 
region of (C) ERVWE1 and (D) ERVFRDE1 in three samples of healthy testes, Tl-h, T2-h and T9-h, and in aforementioned samples of seminomas 
and non-seminoma tumors was examined by the bisulfite sequencing technique. Methylated CpG sites are indicated by solid circles, unmethylated 
CpG sites are indicated by open circles, numbers in parentheses depict the percentage of methylated CpG dinucleotides. 
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Figure. 4. mRNA splicing of ectopically expressed ERVWE1 in JEG-3 (A) and HeLa (B) cells. The pcDNA3 plasmid DNA alone was used for 
mock-transfected control. Expression of non-spliced (open bars) and spliced (solid bars) ERVWE1 mRNA from de novo integrated proviral DNA 
was quantified in two independent cell clones of both JEG-3 and HeLa cells by splice- specific qRT-PCR in two independently repeated experiments. 
Data are presented as the average percentage of the POLR2A expression ± SD from three parallels. 
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Figure. 5. H3 occupancy (A) and H3K36me3 (B) in the intron and 5'-exon end of ERVWE1 in BeWo and HeLa cells. Using the anti-H3K36me3 
and anti H3 antibodies we immunoprecipitated chromatin from BeWo and HeLa cells and analyzed the amount of immunoprecipitated intron 
fragment (open bars) and 5'-exon end (solid bars) by qPCR. HeLa cells were either intact or with stably integrated ectopic ERVWE1 (ERVWE1 A, 
B). (A) The anti-H3CT antibody was used to quantify the total H3 associated with intron and 5'-exon end of ERVWE1 as an average percentage of 
the input DNA. IgG non-specific controls are shown (gray bars) as an average percentage of the input DNA. Data are presented as a mean ± SD 
from three triplicates. (B) The enrichment for H3K36 was normalized to H3 occupancy and the ratio of H3K36me3/total H3 in intron (open bars) 
and 5'-exon end (solid bars) of ERVWE1 is shown. 
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case most proviruses display increased H3 occupancy com- 
parable with intact HeLa cells (Figure 5A). Accordingly, 
the ERVWE1 expression in this clone was low and only 
slightly increased over the intact HeLa cells (Figure 4B). 
The values of H3 occupancy at the 5'-end of the exon were 
low in both clones, only slightly increased in comparison 
with intact HeLa cells, which corresponds with the 
increased pro viral copy number (Figure 5A). Finally, the 
relative level of H3K36me3 at the intron differed in these 
two clones, being substantially higher in the clone 
ERVWE1A and reaching ~30% of H3K36me3 density 
of BeWo cells (Figure 5B). Altogether, we propose that 
efficient expression and splicing of the ERVWE1 pro virus 
are accompanied by a high level of H3K36me3 at the 
intron and at the intron/exon boundary. In contrast, 
high H3 occupancy at the intron but not at the subsequent 
exon indicates transcriptionally silent provirus and ineffi- 
cient splicing. 



DISCUSSION 

The critical point in cell differentiation is stringent re- 
gulation of tissue-specific genes, which restricts gene ex- 
pression only to the correct cell type and correct 
developmental stage. In this study, we compared the 
mechanisms of epigenetic control, i.e. CpG methylation, 
chromatin modifications, and splicing of two HERVs 
encoding syncytins, fusogenic retroviral glycoproteins 
involved in the development of placenta. It was shown 
previously (24,25) that CpG methylation-dependent tran- 
scriptional suppression of syncytins might be the general 
regulatory mechanism protecting non-placental tissues 
from inadvertent cell-to-cell fusion. We add now that sup- 
pressive chromatin modifications strengthen this control. 
CpG methylation of retroviral LTRs has already been 
coupled with transcriptional suppression of e.g. Rous 
sarcoma virus (RSV), murine leukemia virus, human im- 
munodeficiency virus type 1 (HIV-1), human T-cell 
leukemia virus, and HERV-K (35-40). Heavy CpG 
methylation of ERVWE1 in non-placental cells is accom- 
panied by deacetylation and trimethylation at H3K9, 
which results in a strong resistance against reactivation 
by 5-azacytidine and trichostatin A (24). Similar epigen- 
etic marks and resistance were described in the latent res- 
ervoir of human immunodeficiency virus (37) and in long- 
term silenced retroviral vectors (41). Transcriptional 
suppression of ERVWE1 is, however, leaky in the testes 
and the tight control of splicing prevents formation of 
translatable syncytin-1 mRNA and expression of its 
protein product. 

Promoter methylation of ERVWE1 and ERVFRDE1 
does not correlate with their weak but detectable tran- 
scription in the testes (4,6). This might be caused by 
some testis-specific and male germ line-specific epigenetic 
mechanisms of transcriptional control leading to increased 
transcription noise (42). Transcription of HERV-Ws in 
the germ line, at least in the past, is reflected by the 
presence of numerous HERV-W-derived processed 
pseudogenes in the human genome (43,44). Furthermore, 
very low expression of ERVWE1 was also observed in 



HeLa cells and lymphoblastic MCF cells with heavily 
methylated 5'-LTR (Figure 2, Supplementary Figure S2). 
Taken together, epigenetic suppression of syncytins is not 
absolute and additional mechanisms, e.g. splicing and pro- 
cessing of ERVWE1 mRNA, must be involved in their 
control. 

Splicing of genomic ERVWE1 mRNA was already 
shown to be inefficient in the testes as only the pseudo- 
spliced form could be seen on the Northern blot (4). 
Smallwood et al. (45) demonstrated the increasing level 
of pseudo-spliced ERVWE1 mRNA in term placenta 
and suggested that the relative amounts of spliced and 
pseudo-spliced mRNAs could regulate syncytin-1 expres- 
sion during pregnancy. Finally, Gimenez et al. (29) found 
increased levels of spliced ERVWE1 mRNA in several 
testicular tumors. Based on these findings, we focused 
on splice-specific quantitative analysis of ERVWE1 and 
ERVFRDE1 expression in trophoblastic and non- 
placental cells. We describe here a very stringent control 
of splicing at both syncytin loci, which, in the case of 
ERVWE1, is very robust under extremely high ectopic 
expression of the cloned provirus. Importantly, we have 
found deregulation of splicing in biopsies of seminomas. 
In contrast to previous studies (4,26,29,46), we detected 
the full-length and non-spliced ERVWE1 transcript. We 
controlled the DNA contamination in our splice-specific 
qRT-PCR and sequenced the amplified product. These 
controls confirm that our detection of non-spliced tran- 
script does not come from the genomic DNA or from 
other HERV-W members. We can only speculate why 
this transcript has not been detected so far, and further 
research will be needed to fully explain the splicing 
strategy of ERVWE1 and the system of choice of the 
spliced fragment length. 

In comparison with the results by Gimenez et al. (29), 
we observed only a slight demethylation of 5'-LTR of 
ERVWE1 in seminomas with released ERVWE1 expres- 
sion and splicing. The comparison of both studies is, 
however, difficult because of the small number of tumor 
samples and various degrees of differentiation. The weak 
correlation between ERVWE1 expression and hypo- 
methylation of its 5'-LTR could be explained by excep- 
tional epigenetic features of male germ cells such as high 
level of CpG methylation, presence of rare histone 
variants, protamine occupancy, mimicking methylcytosine 
by hydroxymethylcytosine, etc. We analyzed whether the 
deregulation of transcription and splicing in testicular 
cancer was associated with the expression of GCMa, but 
we did not detect any GCMa mRNA using RT-PCR 
(data not shown). The deregulation of syncytin-1 splicing 
probably occurs not only in testicular germ cell tumors, 
particularly seminomas, but also in other various human 
cancers, e.g. in breast carcinomas (18) and endometrial 
carcinomas (46). 

The regulatory potential of retroviral splicing generally 
results from the need in infectious retroviruses to balance 
between genomic mRNA and spliced subgenomic 
mRNA(s) and produce the right proportion of protein 
component assembling into new functional virions. 
HIV-1 translates its own fran.y-activator, Tat, from a 
spliced mRNA transcript and downregulation of HIV-1 
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splicing is observed during the provirus persistence in 
memory cells (47). Another example of specific splicing 
control is the suppression of env splicing of RSV in mam- 
malian cells (48). It was suggested that mammalian cells 
lack chicken- specific trans factor(s) binding to cis regula- 
tory sequences, which could explain the suppression of env 
splicing (48-50), and the non-permissiveness of these cells 
for RSV replication (51). In replication-defective HERVs, 
the balance of non-spliced and spliced transcripts is not 
important any more but the cis splice signals can be 
adopted for cell-specific control of splicing and expression 
of retroviral glycoproteins. Cell-specific splicing in en- 
dogenous retroviruses remains to be studied systematically 
but in the light of our recent findings, it might be a useful 
indicator of biological activity of HERVs with intact env 
genes, which do not fulfill the criteria for sensu stricto 
syncytins. Blaise et al. (52) identified a new four-member 
family of HERVs, HERV-P(b), with one full-length env 
gene, ZFERV-like env, fusogenic in HeLa cells. Its expres- 
sion was observed in many healthy tissues without any 
significant specificity for the placenta. Out of six 
full-length env genes of the young HERV-K(HML-2) 
family, the env encoded by HERV-K108 was also shown 
fusogenic in human embryonic kidney cells as well as in 
mouse, hamster and cat cells (53). Similarly, the rest of 18 
intact env ORFs, which did not score yet in cell fusion 
assays, might be proven as biologically relevant in the 
future. Description of their splicing in various tissues 
could identify particular cell types where these retroviral 
glycoproteins play any role in cell functioning. 

The striking contrast of splicing efficiency in HeLa cells 
as an example of epithelial tumors and in germ cell tumors 
including choriocarcinoma cell lines (Figures 2 and 3) can 
be used to analyze the mechanism controling endogenous 
ERVWE1 pre-mRNA splicing. Cell-specific splicing 
factors can be involved but our results point to the role 
of chromatin context and, in particular, the modifications 
of histones associated with exon/intron boundary. At least 
partially efficient splicing in HeLa cells can be observed 
after the stable integration of the unmethylated cloned 
ERVWE1 provirus (Figure 4) into a new chromosomal 
position different from the endogenous locus at 7q21-22. 
The availability of trophoblast-specific transcription 
factors binding to the URE sequence could further increase 
the efficiency of splicing in stably transfected HeLa, and 
experiments with ectopic expression of GCMa are highly 
needed to explain the control of ERVWE1 splicing. The 
requirement of chromatin context results probably from 
the co-transcriptional occurence of pre-mRNA splicing 
and elongation rate dependent on the chromatin configur- 
ation. RNA polymerase II and transcription activators 
associate with splicing factors (31,54) and slow elongation 
through the intron/exon boundary or even stalled RNA 
polymerase II gives more time to recognize the splicing 
signals, which is particularly important for the suboptimal 
signals used for alternative exons inclusion (54-56). In our 
study, we have shown the high density of H3K36me3 
along the intron-exon boundary of ERVWE1 in Be Wo 
cells with high expression and efficient splicing. This is 
rather unusual as most studies demonstrated 3'-exon en- 
richment of H3K36me3. The 5'-intronic sequence in 



retroviruses, however, is of coding potential in non-spliced 
transcripts and behaves as an alternative exon. A more 
extensive study along the provirus ERVWE1 also 
comprising the 2.1 kb intron-like insertion could elucidate 
this point. Our approach using the HeLa cell clones with 
various expression of ectopic proviruses could also help to 
better understand the blocks of ERVWE1 transcription 
and splicing; we, however, need the exact analysis of 
copy numbers and genomic positions of proviruses in in- 
dividual clones. 
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